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1 Introduction 



The concept of causality defined by Granger (1969) is widely used to analyze cause 
and effect relationships between macroeconomic and financial variables (see e.g. 
Sims (1972), Ashenfelter and Card (1982), Hamilton (1983), Lee (1992), Hiemstra 
and Jones (1994), Renault and Werker (2005), Gelper and Croux (2007)). The 
Granger causality has also been studied in others areas : neuroscience (see e.g. 
Brovelh et al. (2004), Seth (2008)), gene networks (Fujita et al. (2009)), geophysics 
(Reichel, ThejU and Lassen (2001)), or sociology (Deane and Gutmann (2003)) are 
some application domains among others. Causality relationships are often analyzed 
by taking into account only the past values of studied variables. In many situations 
the prediction of the unobserved current variables X2t can however be improved 
by including the available current information of variables Xn. In such a case the 
instantaneous causality relation between Xn and X2t is investigated (see Liitkepohl 
(2005, p 42)). 

In the stationary VAR processes framework, the instantaneous causality is usu- 
ally tested by using Wald tests for zero restrictions on the innovation's covariance 
matrix. Standard tools available in the commonly used softwares (see Liitkepohl and 
Kratzig (2004)) are based on the assumption of i.i.d. Gaussian innovations. The 
weight matrix of the test statistic has to be corrected by using the White type covari- 
ance matrix when the error process is assumed i.i.d. but non Gaussian (see White 
(1980)). In some cases models which produce nonlinear stationary processes are 
considered for the error terms as the GARCH or All-Pass models (see e.g. Bauwens, 
Laurent and Rombouts (2006) or Andrews, Davis and Breidt (2006)). These models 
allow to take into account some dependence in the innovations but also suppose 
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that the unconditional variance of the innovations process is constant. In order to 
get a standard asymptotic distribution of the Wald test statistic in these situations, 
Heteroscedasticity and Autocorrelation Consistent (HAC) corrections can be used 
(see Den Haan and Lievin (1997) for the HAC estimation). 

Nevertheless many applied papers questioned the assumption of a constant un- 
conditional variance structure. For instance Sensier and van Dijk (2004) found that 
most of the 214 U.S. macroeconomic variables they investigated exhibit a break in 
their unconditional variance. Ramey and Vine (2006) highlighted a declining vari- 
ance of the U.S. automobile production and sales. Mc-Connell and Perez-Quiros 
(2000) documented a break in variance in the U.S. GDP growth and pointed out 
that neglecting non constant variance can be misleading for the data analysis. It 
emerges from these studies that processes with non constant unconditional vari- 
ance are common features in practice. All these observations led us to consider the 
case of instantaneous causality relationships where the unconditional variance of the 
structural innovations changes over time. 

Numerous tools for time series analysis in presence of non constant variance have 
been proposed in the literature. For instance Tsay (1988), Horvath, Kokoszka and 
Zhang (2006) or Sanso, Arago and Carrion (2004) proposed tests for detecting un- 
conditional variance changes in several situations. Kokoszka and Leipus (2000) and 
Dahlhaus and Rao (2006) studied ARCH processes with non constant unconditional 
variance. Robinson (1987), Hansen (1995), Francq and Gautier (2004) or Xu and 
PhiUips (2008) among other references investigated univariate linear models allowing 
for a non constant variance. Starica (2003) considered a deterministic non constant 
specification for the unconditional variance of stock returns, and noted that such an 
approach can perform as well as the stationary GARCH(1,1) model. Kim and Park 
(2010) studied cointegrated systems with non constant variance. Bai (2000), Qu and 
Perron (2007), and Patilea and Raissi (2012) among others investigated the estima- 
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tion of multivariate models with time-varying variance. Aue, Hermann, Horvath 
and Reimherr (2009) proposed a test procedure for detecting variance breaks in 
multivariate time series. 

In this paper we focus on the test of zero restriction on the time- varying variance 
structure. We highlight that the standard Wald test for instantaneous causality im- 
plemented in the commonly used softwares does not provide suitable critical values 
when the variance structure is time-varying. It is also established that the tests 
based on White or HAC corrections of the Wald test statistic can suffer from a 
severe loss power in certain important situations. More precisely these tests may 
be unable to detect some important alternatives as periodic changes or when the 
covariance structure is close to zero, so that its sign likely changes. Noting that 
the previous tests are not intended to handle data with non constant unconditional 
variance, a new approach for testing instantaneous causality taking into account 
non-stationary unconditional variance is proposed in this paper. It is however found 
that the asymptotic distribution of the modified statistic is non standard involv- 
ing the unknown non constant variance structure in a functional form. When the 
asymptotic distribution is non standard, the wild bootstrap method is widely used 
in the literature for the analysis of time series possibly displaying (unconditional) 
heteroscedasticity /dependence (see e.g. Gongalvez and Kilian (2004), Horowitz, Lo- 
bato, Nankervis and Savin (2006) or Inoue and Kilian (2002)). Therefore a wild 
bootstrap procedure is provided for testing zero restrictions on the non constant 
variance structure. It is established through theoretical and empirical results that 
the modified test is preferable to the tests based on the spurious assumption of 
constant unconditional variance. 

The plan of the paper is as follows. In the next section, we introduce the VAR 
models with non constant variance. In section [3] the testing problem for instanta- 
neous causality between subvectors of a VAR process with non constant variance 
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is discussed. The asymptotic properties of the tests based on the assumption of 
constant unconditional variance are presented. It emerges from this part that this 
kind of tests should be avoided in our non standard framework. As a consequence a 
test based on the wild bootstrap procedure taking into consideration non constant 
variance is built. The finite sample properties of the tests are investigated in Sec- 
tion |4] by Monte Carlo experiments. We also consider US macroeconomic data to 
illustrate our findings. In section |5} we draw up a conclusion on our results. 

2 Vector autoregressive model with non constant 
variance 

Consider the following VAR model 

Xt = AoiXt.i + ■■■ + AopXt-p + ut (2. 1) 

ut = Htet, 

where Xt G M.'^ and it is assumed that X_p+i, . . . , Xq, Xi, . . . , Xt are observed. The 
d X d dimensional matrices A^i are such that detA{z) ^ for all \z\ < 1, where 
A{z) = ld — X]r=i^oi-2^* with Id the dx d identity matrix. Note that the process [Xt) 
should be formally written in a triangular form, but the double subscript is sup- 
pressed for notational simplicity. In the following assumption we give the structure 
of the variance by using the rescaling approach of Dahlhaus (1997). Tt corresponds 
to the (j-field generated by {e^ : A; < t} and || . ||r is such that || x \r'-= {E \\ x ||'')^/^' 
for a random variable x with || . || the Euclidean norm. 

Assumption Al: (i) The d x d matrices Ht are lower triangular nonsingular 
matrices with positive diagonal elements and satisfy Ht = G{t/T), where the com- 
ponents of the matrix G{r) := {gki{r)} are measurable deterministic functions on 
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the interval (0, 1], such that sup^gj-Q ^ \gki{r)\ < oo, and each satisfies a Lipschitz 
condition piecewise on a finite number of sub-intervals partitioning (0, 1]. The ma- 
trices S(r) = G{r)G{r)' are assumed positive definite for all r in (0, 1]. 
(ii) The process (e^) is a-mixing and such that E{et \ J^t-i) = 0, E{ete[ \ J^t-i) = Id 
and sup^ || ||4^< oo for some fi > 1. 



If we suppose the process (et) Gaussian and that the functions gki{-) are con- 
stant we retrieve the standard case. Nevertheless when the unconditional variance 
is time- varying, it can be expected that the tools developed in the stationary frame- 
work are not valid or can suffer from drawbacks since the tests for instantaneous 
causality are directly based on the variance structure. From the piecewise Lipschitz 
condition abrupt breaks as well as smooth changes are allowed for the unconditional 
variance. In particular the variance may have a periodic behaviour. The frame- 
work given by our assumption is similar to that of numerous papers in the literature 
and encompasses the case of piecewise constant variance structure (see Pesaran and 
Timmerman (2004), Hamori and Tokihisa (1997) or Xu and Phillips (2008) and 
references therein). However since we assumed that E{ete[ \ J^t-i) = Id, the er- 
ror terms cannot display GARCH effects (as for instance second order correlation). 
Therefore the tools proposed in this paper have to be preferably used for relatively 
low frequency variables for which it is commonly admitted that there is no second 
order dynamics (for instance monthly, quarterly or annual macroeconomic data, see 



Section 4.2 below). In such a situation adding a multivariate GARCH structure to 
our model as in Hafner and Linton (2010) can be viewed as too elaborated. The 
tests proposed in Patilea and Rai'ssi (2012) can be used to check if there is no second 
order dynamics within the data. In the framework of Al we are interested in testing 
zero restrictions on the variance structure Sfr). 



Now re-write model (2.1) as follows 



ut = Htet, 

with Xt-i = . . . , Xl_py and 6*0 = vec{(y4oi, . . . , Aop)} where vec(.) is the 

usual column vectorization operator of a matrix and ® is the usual Kronecker prod- 
uct. The parameter vector Oq may be estimated by Ordinary Least Squares (OLS) 
or Adaptive Least Squares (ALS). Properties of these estimators are established 
in Patilea and Raissi (2012) under Al. Denoting by 6 the ALS (or alternatively 
OLS) estimator of 6^0, we introduce the residuals Ut = Xt — (Xj'.i ® /d)^. Note that 
the more efficient ALS estimation method should be preferred for approximating 
the innovations. Otherwise in practice the autoregressive order is unknown, but it 
is important to ensure that the lag length is well adjusted for the analysis of the 
instantaneous causality. For instance if the lag length is chosen too small we get 
correlated residuals with distorted covariance structure. On the other hand a too 
large autoregressive order would imply a large number of parameters to estimate in 
our multivariate model. The goodness-of-fit of model ( 2.1[ ) can be checked by using 



portmanteau tests proposed in Patilea and Raissi (2011) in our non standard frame- 
work. Hence the lag length is assumed well fitted in the sequel. More particularly 
the OLS and ALS estimators are A/T-asymptotically normal if the lag length is well 
adjusted. 

Let Xit and X2t be the subvectors of Xt := {X'^, X!^^)' with respective dimensions 
di and d2 and let be the di x (i2- dimensional upper right block of := E{utu[). 
Our goal is to determine if it exists an instantaneous causality relation between Xu 
and X2t- The next lemma gives some preliminary results and requires to introduce 
some additional notations. Let Ut := (m'^^^, -UgJ', -dt := U2t ® Uu, Vt := vec{uitu'2t — 
Si2) = u^, ® - vec(Si2), Ht := {H'.^H'^J and G{r) := (G'i(r)', G2(r)')' be in 



line with the partition of Xt. We denote by [z] the integer part of a real number 
z. We also denote by =^ the convergence in distribution and — ?■ the convergence in 
probability. 



Lemma 2.1. Under Al we have as T oo 



t=i 



vec{T}\r))dr, (2.2) 



and 

T 

T-i^{;,^A/-(0,l]), (2.3) 
t=i 

where 

Q= f (G'2(r) ®Gi(r))7W (G2(r) ®Gi(r))'c/r- [ vec{^^\r))vec{J:^\r)ydr 
Jo Jo 

and Ml = E{ete^ ® e^e^). In addition we also have 

T-'^Y.^t^ / {G2{r)®Gr{T))dB^{T) (2.4) 
t=i 

where B^{.) is a Brownian Motion with covariance matrix Q := E{ete[ Cfe'J — 
vec{Id)vec{Id)' , with s G [0, 1]. 



Using (2.3) and (2.4) we shall discuss the test for instantaneous causality between 



Xit and X2t assuming spuriously that the unconditional variance is constant and 
propose a new test adapted to our framework in the next section. Nevertheless 



remarks on the result (2.3) must be made. 



Remark 2.1. Let be 



E(r) 



^ S^Hr) Si2fr) ^ 
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in line with the partition of Xt. If we suppose that Il^'^{r) = for all r G (0, 1], which 
corresponds to the case of no instantaneous causality relation between Xu and X2t 
(see the null hypothesis Hq below), it follows that Vt = ^t = ^2* ® Uu and 



(G2(r) ®Gi(r))A^(G2(r)®G'i(r))'dr. 



In such a situation from Lemmas 5.1, 5^, 5^ and the proof of Lemma 2.1, it is 
clear that 



fl^ := T ^''^^U2tu'2t^ Uitu'i^. ^ ^. (2.5) 
t=i 

In particular when the process (ut) is assumed Gaussian with non constant variance 
the expression of simplifies itself into S^^(r) (g) Ti^^(r)dr by using (5.4). 



Remark 2.2. Assume that the process (ut) is i.i.d. Gaussian with (constant) vari- 
ance noted 



^ ^}} ^ 



V 



/ 



which corresponds to the standard case under the hypothesis Hq below. In such a 
case the expression ofQ simplifies itself into S^^^S^-*^. Under the strong assumption 
of i.i.d. Gaussian error process it can be shown that 



St 



T 



U2tU2t 
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(2.6) 



t=i 



t=i 



3 Testing for instantaneous causality 

In the sequel we follow the notations of Liitkepohl (2005). Denote by X2t(l|{Xjt|/i; < 
t}) the optimal one step linear predictor of X2t at the date t — 1, based on the 
information of the past of the process {Xt). Similarly we define the one step linear 
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predictor X2t{l\{Xk\k < t} U {Xu}) based on the past of (Xt) and the present of 
(Xit). It is said that there is no instantaneous hnear causahty between (Xif) and 
(^2*) if 

X2t{l\{Xk\k <t}U {Xu}) = X2t{l\{Xk\k < t}). 

In the case of non constant variance following the assumption Al and, more partic- 
ularly, because we assumed that the HtS are lower triangular nonsingular matrices 
with positive diagonal elements, it can be shown that there is no instantaneous 
causality between X^ and X2t if and only if the S^^^'s are all equal to zero following 
the same arguments to those in Lutkepohl (2005, pp 46-47). Consequently in our 
non standard framework the following pair of hypotheses has to be tested: 

Ho : J:^^{r) = vs //i : J:^\r) for r e [a, b] C [0, 1] with fixed a < b. 

Now if we consider the case where the variance is assumed constant = E„ for all t, 
it is well known that there is no instantaneous causality between X2t and Xu if and 
only if E^^ = with obvious notation. Therefore the following pair of hypotheses is 
tested under standard assumptions: 

-.El'^O vs H[ : E]^ ^ 0, 

The block Y}^ is usually estimated by ^Y^t=\ ^"^^^2% which converges in probability 
to Jq E^^(r)o?r under Al. Hence such hypothesis testing does not take into account 
the time- varying variance in the sense that it can only be interpreted as a global zero 
restriction testing of the covariance structure, i.e. testing 'E^^(r)dr = against 
the alternative E^^(r)dr 7^ 0. Then Hq and H[ are inappropriate for testing 
instantaneous causality in our non standard framework. 
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It is interesting to point out that Hq is a particular case of H'q, i.e. Hq C Hq, since 
Hq corresponds to S^^(r)(ir = 0. On the other hand since S^^(r)(ir 7^ imphes 
that Si2(r) ^ 0, then H[ C H^. More precisely, if S^^^r) 7^ for r G [a, b] C [0, 1], we 
may have either S^^(r)(ir 7^ 0, which corresponds to Hi r\H[, or S^^(r)(ir = 0, 
which corresponds to Hi fl H'q. Note that we have Hi = {Hi fl i/j) U {Hi fl /Jq) and 
{Hi n /JJ) n (/Ji n Hq) = 0. it is shown in the next part that the case Hi fl Hq 
entails a loss of power for tests built on the assumption of constant unconditional 
variance of the innovations. 



3.1 Tests based on the assumption of constant error vari- 
ance 

In this section the consequences of non constant variance on the instantaneous 
causality tests based on the spurious assumption of a stationary process are an- 
alyzed. Let be 6t '■= T~2 J2t=i where we recall that i^t = ^Uit- The standard 
test statistic is given by 



Sst — Sri^st) ^^T, 



where Clgt is defined in (2.6). Under Al it can be shown that Cist Jq T?'^{r)dr ® 
Jq Ti^^{r)dr =: ilst and we obviously have 7^ il^t in general. 

If the practitioner (spuriously) assumes that the error process is iid but not 
Gaussian, uu and U2t could be dependent and the following statistic with White 
type correction should be used: 



Sw — S'rp{^lw) ^St, 



where the weight matrix is defined in (2.5). Recall that fi^^, is a consistent 
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estimator of f2 under Hq and it is clear from the proof of Lemma 



2.1 



that 



Q^-^Q+ [ veciJ:^\r))Yec{J:^\r)ydr (3.1) 
Jo 

under Hi. 

Finally the practitioner may again (spuriously) suppose that the error process is 
stationary and that the observed heteroscedasticity is a consequence of the presence 
of nonlinearities. However note that the assumed heteroscedasticity is only condi- 
tional while the unconditional variance is still constant in this case. This kind of sit- 
uation can arise if we (spuriously) assume that the innovations process is driven by a 
GARCH model or any other model displaying nonlinearities such as models driven 
by hidden Markov chains or All-Pass models (see Amendola and Francq (2009)). 
Note that the test proposed in Sanso, Arago and Carrion (2004) could be used to 
detect changes in the unconditional variance. In such a case HAC type weight ma- 
trices should be used in the test statistic. For simplicity we focus on the VARHAC 
weight matrix (see Den Haan and Levin (1997)). Denote by Am,i, ■ ■ ■ , Am,m the 
coefficients of the LS regression of on dt-i, ■ ■ ■ ,'^t-m, taking = ior t < 0. 
Introduce Zm,t the residuals of such a regression and flh = -4.(1)^^S2^(1)^^ where 
^(1) = Id^d2 - YlT=i'^rn,k and = T"^ XlLi ^"^.t^^,*- The order m can be chosen 
by using an information criterion. The following statistic involving VARHAC type 
weight matrix may be used 

Sh = s'j^{Qhy-^ST. 

Since we assumed that the autoregressive order p is well adjusted (or known), the 
process = U2t ® uu is uncorrelated and it can be shown that the Am,kS converge 
to zero in probability. Therefore Clh ^ ^ under Hq and 
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nh^n+ I vec(S^2(r))vec(Si2(r))'dr, (3.2) 
Jo 

under Hi, so that Cl^ and VLyj are asymptotically equivalent in the framework of Al. 
This is not surprising since second order dynamics are in fact excluded in Al. 

In this part the asymptotic properties of the above statistics are investigated. 
The asymptotic behavior of the statistics in our non standard framework is first 



established under Hq. The results are direct consequences of (2.3) 



Proposition 1. Assume that Hq hold. Then under Al we have as T ^ oo 

did2 

(3.3) 

where the Zj 's are independent A/'(0, 1) variables, and Ai, . . . , \d^d2 o^f^^ the eigenval- 

_ 1 _ 1 

ues of the matrix Q^^'^QQ^^'^ . In addition we also have 

For a fixed asymptotic level a, the standard test iWst hereafter) consists in 
rejecting the hypothesis of no instantaneous causality between Xn and if Sgt > 
Xdid2,i~a where Xdid2,i-a is the (1 - a)th quantile of the Xdid2 i^^- Therefore it 



appears from (3.3) that the standard test is not able to control the type I error 
since flgt 7^ in general. Denote by (resp. Wh) the test consisting to reject the 
hypothesis of no instantaneous causality if > Xdid2i-a (resp. Sh > Xdid2i-a)- 



From (3.4) we see that the Ww and Wh tests should have good size properties for 
large enough T. 

Now we turn to the study of the power properties of the Wst, Ww and Wh tests 
in the cases Hi fl H'l and Hi fl Hq. We first consider the situation Hi fl i/J, which 
corresponds to the case /q T}'^{r)dr ^ 0. From tjA it is clear that T-'^S,, with 
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i = st, w, h, converge in probability to strictly positive constants if S^^(r)(ir 7^ 0. 
It follows that the Sst, and Sh statistics grow to infinity as fast as T. Therefore 
we can expect that the tests based on the assumption of constant variance will detect 
a possible instantaneous causality for large enough sample sizes when Hi fl H[ hold. 
The abilities of the Wgt^ and Wh tests to detect the case S^^(r)(ir 7^ are com- 
pared considering the approximate Bahadur slope approach (Bahadur (I960)). For 
the test based on the Sst statistic define qst{x) = P ^j^j ^ ^) x > 0. 

For a fixed alternative such that zu = S^^(r)(ir 7^ 0, consider the asymptotic slope 
Csti'cu) = 2\imT-^ooT~^qst{Sst)- Define similarly c^i^) and Chi^w) for the W^, Wh 
tests and also the asymptotic relative efficiencies AREs^,Sst{^) = Cw{-u7)/cst{'^) and 
AREs^^Ssti^) = Chi'co) / Cst{'co). A relative efficiency y4i?ii^s^ ^5^^(07) > 1 suggests that 
the Wh test is more able to detect the case Y}'^{r)dr ^ than the test based on 
the Sst statistic. In such a case the p-values of the Wh test converge faster to zero 
than those of the Wst test. 



Proposition 2. Under Al we have AREs^^Ssti^) — ^ '^^^ ^REsh-Ssti^) — ^ /^^ 
every alternative such that w = S^^(r)(ir ^ 0. 



The proof of Proposition [2] is similar to the proof of Proposition 5.3 in Patilea 
and Raissi (2012) and is then omitted. When the errors are assumed iid Gaussian 
we obtain AREs^^Ssti^) — 1 ^^'^ ^-^-^5h,s'st('^) — 1- Nevertheless if the variance 
structure of the errors is non constant with S^^(r)(ir 7^ 0, the Wh or W^ tests 
achieve a gain in power when compared to the Wst test. 

Finally we study the ability of the Wst, W^ and Wh tests in detecting instan- 
taneous causality when Hi fl Hq hold, that is Si2(r) ^ and J J:^\r)dr = 0. 



In this case Clh = Op{l) and fi^ = Op{l) from (3.1) and (3.2) and we also have 
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^st = Op{l), while the non centrahty term is T~5 YlJ=i = o(T^). Therefore we 
have Si — Op{T) with i — st,w,h'm the case HiDH'q. When such eventuahty is con- 
sidered, it is clear that the tests based on the assumption of stationary errors may 
suffer from a severe loss of power. This is a consequence of the fact that this kind of 
tests are not intended to take into account time varying variance. The case Hi fl H'q 
can arise in the important case where E^^(r) ^ but close to zero so that E^^(r) 
may have a changing sign. Even when at least one of the components of E^^(r) is 
far from zero, we can have E^^(r)(ir = as for instance in some cases where the 
variance structure is periodic. This can be seen by considering the bivariate case and 
taking E^^(r) = ccos(7rr) or E^^(r) — cl^Q i^{r) — cl-^i ^{r) with c e M. Therefore 
the tests based on the spurious assumption of constant unconditional variance for 
the error must be avoided. 

In summary it is found that the tests based on the White and VARHAC correc- 
tions should control well the type I errors for large enough samples on the contrary 
to the Wst- In addition it appears that in the case of non constant unconditional 
variance and when Hi n H'l hold, the Wh and tests have better power properties 
than the Wgt test. Therefore the Wh and W^, tests should be preferred to the Wgt 
test when the unconditional variance is time- varying. However it is also found that 
the tests based on the assumption of constant variance may suffer from a severe loss 
of power in the important cases where E^^(r)o?r = (or jJ^E^^(r)o?r ~ 0). A 
bootstrap test circumventing this power problem in the case Hi fl Hq is proposed in 
the next part. 
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3.2 A bootstrap test taking into account non constant vari- 



ance 



Introduce 5^ = ^ ^ Yl^i^i with s G [0, 1] and consider the following statistic: 



5*6 = sup \\6s\\l. 
s6[0,l] 



Under Hq and from (2.4) we write 



{G2{r)®G,{T))dB^{r) := K{s), 

where the covariance matrix becomes VL = M. = E{ete[ ® ^t^t)- Therefore under Hq 
we have from the Continuous Mapping Theorem 



S,^ sup ||i^(s)||^, (3.5) 
se[o,i] 

since the functional f{Y) = sup5g[o,i] ll^l-^)!!! is continuous for any Y G D[0, 1], the 
space of cadlag processes on [0,1]. Under Hi we obtain 

[Ts] [Ts] 

t=i t=i 



with defined in (2.4). The first term in the right hand side of (3.6) converges 
to zero in probability, while we have X]l=i ^^6^(2^^) = JJ* vec(S^^(r))(ir + o(l) 
and sup^gjQ;^] || {/g* vec(S^^(r))(ir} = C > 0. Hence we have in such a situation 
Sh = CT + OpiT). 



From (3.5) we see that the asymptotic distribution of Sb under the null Ho is 
non standard and depends on the unknown variance structure and the fourth order 
cumulants of the process (e^) in a functional form. Thus the statistic Sb cannot di- 
rectly be used to build a test and we consider a wild bootstrap procedure to provide 
reliable quantiles for testing the instantaneous causality. In the literature such pro- 
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cedures were used for investigating VAR model specification as in Inoue and Kilian 
(2002) among others. The reader is referred to Davidson and Flachaire (2008) or 
Gongalves and Kihan (2004, 2007) and references therein for the wild bootstrap pro- 
cedure method. For resampling our test statistic we draw B bootstrap sets given by 
''^f ^ •= ^i^^'^t = ^t^^U2t ® Uit, t G {1, . . . , T} and z G {1, . . . , B}, where the univariate 
random variables ^f*"* are taken iid standard Gaussian, independent from (ut). For 
a given ie{l,...,B} set 6i'^ = T''^ and S^''^ = sup^^p,!] Il^^'^ll'- In our 

procedure bootstrap counterparts of the Xt's are not generated and the residuals are 
directly used to generate the bootstrap residuals. This is motivated by the fact that 
zero restrictions are tested on the variance structure, so that we only consider the 



residuals in the test statistic. In addition it is seen from (5.3) that the residuals 
and the errors are asymptotically equivalent. It is also clear that the wild bootstrap 
method is designed to replicate the pattern of non constant variance of the residuals 
in sl^\ More precisely we have under Al 

sup \\K{s)\g (3.7) 

se[o,i] 



where we denote by the weak convergence in probability. A proof of (3.7) is 
provided in the Appendix. Note that we have by construction E*{^l^^'^}t) = even 
when the alternative is true, that is E{dt) 7^ (recall that dt = U2t (S) uu). As a 
consequence the result (3.7) is hold whatever S(r)^^ = or S(r)^^ 7^ 0. 



The Wh test consists in rejecting Hq if the statistic Sb exceeds the (1 — «) quantile 
of the bootstrap distribution. Under Hi with S(r)^^c/r 7^ we note that all the 
statistics considered in this paper increase at the rate T. However when E(r)^^ 7^ 
with S(r)i2c/r ^ 0, we can expect that the Wi, test is more powerful than the 
tests based on the assumption of constant unconditional variance. In such situations 
we may have Sw = Op{T), Sh = Op{T) while again 5*;, = Op{T). If the unconditional 
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variance is constant, that is T,^^{r) = S^^, note that = Op(T), Sh = Op(T) 
and Sb = Op{T). Hence we can expect no major loss of power for the Wb when 
compared to the and Wh tests if the underlying structure of the variance is 



constant. In general since Sb = Op{T) and in view of (3.7), the Wb test is consistent. 
From the above results we can draw the conclusion that the Wb is preferable if the 
unconditional variance is non constant for large enough sample sizes. 



4 Numerical illustrations 

In this section the Wb test is compared to the Wst and Ww tests. The VARHAC 
statistic being asymptotically equivalent to the White statistic under Al, as noted 
above, we did not take into account this test in our comparisons. First the type I 
errors and power properties of the three tests are compared using simulated bivariate 
VAR(l) processes with unconditional time- varying variance. The tests are next 
applied to two macroeconomic data sets. 

4.1 Simulation study 

For our experiments we simulated simple bivariate VAR(l) processes where the 
autoregressive parameters are inspired from those estimated from the money supply 
and inflation in the U.S. data (see section below). The data generating process can 
be written as 



V 



(4.1) 



0.64 -1 
-0.01 0.44^ 

where the innovations are Gaussian with variance structure S(r) respecting the 
assumption Al. Two cases are considered for this structure : 



\ ( \ 



Case 1: Empirical size setting. It does not exist an instantaneous causality 
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relation between Xi t and X2 1 '■ 



Sfrl 







V 



Vr e (0,1]. 



V 



S22(r) 

where Ti^'^{r) = a — cos(6r) and S^^(r) = a + sin(6r) correspond to the non 



constant variances of the innovations. We take a > 1 which represents the 
level of these variances and b their angular frequency. 

Case 2: Empirical power setting. It exists an instantaneous causality 
relation between Xi t and X2 1 '■ 



Efrl 



Vr G (0,1]. 



yS-(r) S^^(r)y 

where T,^'^(r) = csin(27rr) respects the case T.^^(r)dr = with S-'^^(r) 7^ 



almost everywhere on r G (0, 1], and S^^(.), S^^(.) are defined as in Case 1. In 
particular, the constant c will allow to investigate the ability of our modified 
test for detecting such alternative when it gets closer to the null hypothesis. 

Note that a, b and c have to be chosen to fulfill the positive definite condition 
on S(r) for all r in (0,1]. For instance this property is checked if a = 1.1, 6 = 11 
and I > c > 0. 

The finite sample properties of the tests are assessed by means of the following 



Monte Carlo experiments. For each sample size, 1000 time series following (4.1) are 



generated. The lag length is assumed known and the autoregressive parameters are 
estimated by using the commonly used OLS method. In all our experiments we use 
299 bootstrap iterations for the Wb test. We use processes generated by Case 1 
to shed light on the control of the type I errors of the studied tests. The results 
are reported in Table [TJ On the other hand processes generated by Case 2 are 
considered for the power study. The results are given in Table |2] and Figure [T] Note 
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that in Table [T] and Table [2] we take a = 1.1, 6 = 11 and c = 0.5 while in Figure [T] 
we take several values for c and a = 1.1, 6 = 11. 





Asymptotic nominal level 


1% 


5% 


10% 


Wst 




Wt 


Wst 








w^ 


Wk 


Sample size 


50 


0.010 


0.007 


0.007 


0.048 


0.050 


0.047 


0.102 


0.113 


0.105 


100 


0.009 


0.009 


0.010 


0.057 


0.066 


0.066 


0.095 


0.096 


0.107 


200 


0.011 


0.010 


0.011 


0.045 


0.047 


0.052 


0.101 


0.116 


0.122 


500 


0.011 


0.010 


0.010 


0.042 


0.047 


0.050 


0.099 


0.101 


0.101 


1000 


0.008 


0.009 


0.010 


0.056 


0.051 


0.051 


0.088 


0.097 


0.103 



Table 1: The empirical size for the studied tests with asymptotic nominal level 1%, 
5%, 10% and a = 1.1, 6 = 11, c = 0.5. 





Asymptotic nominal level 


1% 


5% 


10% 


Wst 


Wn, 


Wb 


Wst 


Wn, 


Wb 


Wst 


Wn, 


Wb 


Sample size 


50 


0.014 


0.005 


0.003 


0.056 


0.040 


0.045 


0.110 


0.085 


0.132 


100 


0.012 


0.005 


0.012 


0.056 


0.038 


0.102 


0.098 


0.079 


0.213 


200 


0.017 


0.010 


0.076 


0.063 


0.048 


0.305 


0.106 


0.093 


0.512 


500 


0.011 


0.006 


0.486 


0.050 


0.038 


0.837 


0.105 


0.080 


0.930 


1000 


0.015 


0.011 


0.966 


0.056 


0.045 


0.997 


0.108 


0.088 


1.000 



Table 2: The empirical power for the studied tests based on asymptotic nominal 
levels 1%, 5%, 10% and a = 1.1, 6 = 11, c = 0.5. 

In our example the Wst, and Wb tests seem to control the type I errors 
reasonably well (see Table ig. We can remark that the standard test provides similar 
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results as compared to the other tests. Nevertheless this outcome does not have to 



be generalized in view of (3.3). In addition recall from Proposition |2j that the Wgt 
is less powerful than the and Wh tests. Now if we turn to the alternative given 
by Case 2, Table [2] clearly shows that the Wst and tests have no power as the 
sample sizes increase on the contrary of the Wb test. This confirms the theoretical 
results obtained when Tj^'^(r)dr ^ 0. For instance the Wi, test is almost always 
rejecting the null hypothesis Hq for a sample size of 1000, while the Wst and 
tests are completely not able to detect the alternative in this case. 

In the above power experiments the changes of S^^(r) around zero were fixed by 
a constant c. In this part we illustrate the ability of the tests to detect departures 
from the null hypothesis S^^(r) = 0, while we again have Ti^'^{r)dr = in all 
situations. Figure [T] represents the power of the three tests when the parameter c 
takes several values, while the sample is fixed T = 500. We clearly observe that the 
relative rejection frequencies of the Wb test increases when the covariance structure 
7^ goes away from zero but verifying S^^(r)(ir = 0. On the other hand 
we again remark that the relative rejection frequencies of the tests based on the 
assumption of constant variance remain close to the asymptotic nominal level even 
when c takes large values. 

4.2 Application to macroeconomic data sets 

In this part we compare the Wst and W^ tests with the Wb test by investigating 
instantaneous causality relationships in U.S. macroeconomic data sets. 

4.2.1 Money supply and inflation in the U.S.A. 

The relationship between money supply and inflation is fundamental in the macroe- 
conomic theories explaining the influence of monetary policy on economy. For in- 
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Figure 1: The empirical power of the Wb, Wgt and tests for fixed sample size 
T — 500 and varying c parameter. The asymptotic nominal level is 5% and we take 
a = 1.1, b= 11. 

stance, the quantity theory of money assumes a proportional relationship between 
money supply and the price level. The reader is referred to Case, Fair and Oster 
(2011) or Mankiw and Taylor (2006) concerning the theoretical links which can be 
made between money supply and inflation. Many studies investigate this relation 
from an empirical point of view. Their results are however ambiguous. For in- 
stance, Turnovsky and Wohar (1984) used a simple macro model to investigate the 
relationship and find that the rate of infiation is independent of the monetary growth 
rate in the U.S.A. over the period 1923-1960, while Benderly and Zwick (1985) or 
Jones and Uri (1986) give some evidence of relationship over the respective periods 
1955-1982 and 1953-1984. Here we investigate the hypothesis of an instantaneous 
causal relationship between money supply and inflation in the U.S.A. over the period 
1979-1995. 

The data considered here are the Ml money stock (Ml) and the Producer Price 
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Index for all commodities (PPIaco)- The Ml represents the money supply and 
PPIaco the inflation from the point of view of producers. The Ml index is provided 
by the Board of Governors of the Federal Reserve System while the PPIaco index 
is provided by the US Department of Labor. The data are taken from 04/1979 to 
12/1995 with a monthly frequency and are available on the web site of the Federal 
Reserve Bank of St. Louis (Series ID : Ml and PPIACO). The length of the series 
is T = 200. 

The first differences of the data are considered in the sequel. From Figure [2] 
it appears that the obtained series have non constant variance. We adjusted a 
VAR(l) model to the first differences of the series. The autoregressive order is 
chosen by using portmanteau tests adapted to our non standard framework where 
the variance structure S(r) is time- varying (see Patilea and Raissi (2011) for details). 
The outcomes in Table |3] suggest that the model is well fitted. The estimation of 
the model by the OLS method is given in Table |4j The residuals of this estimation 
are next recovered to implement the tests studied in this paper. Note that we used 
399 bootstrap iterations for the Wb test. 

From Table |5] we see that the p- value of the Wh test is quite different from those of 
the Wst and tests. For instance the null hypothesis of no instantaneous causality 
is rejected by the Wb test for a significance level of 10% while it is accepted by the 
other tests. These observations can be explained by the covariance structure of 
the innovations. Indeed the nonparametric estimation of this covariance structure 
plotted in Figure [3] shows that S^^(r) seems not null over the considered period 
while its seems that S^^(r) ^ 0. 
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1980 1985 1990 1995 



Figure 2: Evolution of the variations AMI and APPIaco- 




r 



Figure 3: The Nadaraya- Watson kernel estimation of the covariance structure S^^(r) 
for the two data sets. The estimator is defined as in Patilea and Raissi (2012) which 
showed that such estimator is consistent under Al unless at the break points. 
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Number of lags 


3 


6 


12 


BPoLS 


0.4384 [1.1198] 


0.8169 [1.9071] 


0.8165 [3.2870] 



Table 3: The p- values of the Box-Pierce test adapted to our non standard framework. 
The corresponding statistics are displayed into brackets. The BPqls corresponds 
to the portmanteau test based on the OLS proxies of the Mj's. 



A 



01 



0.643 [0.064] -1.124 [0.360] 
-0.009 [0.007] 0.439 [0.102] 

Table 4: The OLS estimators of the matrix Aqi (see equation (2.1)) for the adjusted 
VAR(l) model. Standard deviations of the parameters are displayed into brackets. 





Wst 






p- values 


0.268 [1.225] 


0.201 [1.632] 


0.058 [10.54] 



Table 5: The p- values of the Wst, and Wb tests. The corresponding test statistics 
are displayed into brackets. 

4.2.2 Merchandise trade balance and balance on services in the U.S.A. 

The merchandise trade balance and the balance on services can be seen as indicators 
of the economic health of a country. The U.S. merchandise trade balance is the 
account which redraws the value of the exported goods and the value of the imported 
goods. The U.S. balance on services is similarly the account which redraws the 
value of the exported services and the value of the imported services. Here, we 
search to quantify if it exists an instantaneous causality relation between these two 
macroeconomic indicators. The data are provided by the Bureau Analysis of the U.S. 
Department of Commerce and go from the 01/1960 to the 01/2011 with quarterly 
frequency. The length of the series is T = 204. They are available on the web site 
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I I I I I I 

1960 1970 1980 1990 2000 2010 

Figure 4: Evolution of the first differences of the U.S. merchandise trade balance 
and the U.S. balance on services in billion of U.S. dollars. 

of the Federal Reserve Bank of St. Louis (Series ID : BOPBM and BOPBSV). 

Similarly to the first data set, we consider the first differences of the data (see 
Figure |4]). A VAR(2) model is adjusted to the data (estimation results not reported 
here). The adequacy of the model is again checked using portmanteau tests which 
are valid in our framework. The portmanteau test suggests to choose a VAR(2) 
model. Indeed the p-value of the BPqls test is 0.65[5.29] and for 5 autocorrelations 
in the portmanteau statistics (the portmanteau statistic is given into brackets). The 
p-values of the three tests are next computed from the residuals as for the first data 
set. The outcomes displayed in Table [6] show that the considered tests have quite 
different results. In view of the non constant variance of the studied series (see 
Figure Is]), the result corresponding to the Wb test is more reliable. 
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Wst 






p- value 


0.0498 [3.848] 


0.341 [0.907] 


0.441 [190.142] 



Table 6: The p- values of the Wst-, W^ and tests. The corresponding test statistics 
are displayed into brackets. 

5 Conclusion 

In this paper we studied the problem of testing instantaneous causality in the im- 
portant case where the unconditional variance is time- varying. The properties of the 
Wald tests based on the assumption of constant unconditional variance are investi- 
gated in our non standard framework. It emerges that this kind of tests may have 
no power in this non standard framework. As a consequence we proposed a new 
bootstrap test for testing the instantaneous causality hypothesis in the important 
case where the unconditional variance structure is time-varying. In particular we 
found that the proposed bootstrap test is consistent. We illustrated these theoreti- 
cal results through a set of numerical experiments. The outcomes of macroeconomic 
data sets suggest that the Wald test may dehver results which are quite different 
from the bootstrap test. Although our non-standard framework allows for non con- 
stant variance, it assumes that the structural innovations cannot display conditional 
heteroscedasticity. This case could be the object of interesting further researches. 

Appendix 

The following Lemmas are similar to Lemmas 7.2, 7.3 and 7.4 of Patilea and Rai'ssi 
(2012), so that the proofs are omitted. Introduce Vt = vec(tiitti2j — E^) with Ut — 
{u\^, and recall that '&t — U2t ® uu- 
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Lemma 5.1. Under Al we have 



lim^E [viTr]v[Tr]] = (^2 (r ) ® d (r) )>1 (G2 (r) ® Gi (r ) )' - ^;ec(Si2 (r ) ) tie^S^^ (r) )', 
and 

lim E l^^Tr]] = vec(J:^\r)) 
for values r G (0, 1] at which the functions gij{r) are continuous. 



Lemma 5.2. Under Al we have 

T 



t=i t=i 



and 



T 



T 



t=l °° t=l 



(5.1) 



Lemma 5.3. Under Al we have 

T 



t=i -^0 



(G2(r) ® Gi{r))M{G2{r) ® Gi{r)ydr (5.2) 



vec{i:'\r))vec{J:'\r)ydr, 



and 



T ^1 

i=i •''0 



12, 



r)dr. 



Proof of Lemma 2.1 We first give the proof of (2.3). Let us define Ut{9) 



iu[tie),u'2tie)y = Xt- {Xt^i ® ld)0 for any 6 G From the Mean Value 

Theorem we have 



T T 
T"^^M2t®Mit = r"^^M2t®Mu 



(5.3) 



t=l 



t=l 
T 



t=l 



dd' 
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where 9* is between 6 and ^o, and dut{9)/dd' = ~X[_^ ® Id is uncorrelated with ut- 
Hence we write 'Yl^=i'^t = Yl^=i'^t + Op{l) with Vt = vec(Mii-U2t — SP), since 
the estimator 9 is such that VT{9 — 9q) = Op(l). The process (vt) is a martingale 
difference sequence, so that from the Lindeberg central limit theorem Ylt=i "^t 
is asymptotically normal with mean zero. The expression of the covariance matrix 



n can be obtained as follows, using E{ete^\J-'t-i) = Id and Lemmas 5.1 and 5.3 



T T T 

O := hm T-'CoviY, Vt, Vt) = lim T'^ ^ E{v,v',) 



r-5-oo 

t=i t=i t=i 



°° t=i 

T 

hm J2 [E {M,) ® {uuu[,)} - vec(S,i2)vec(S,i2)'] (5.4) 

t=l 
T 

hm ^ [E {{H,,e,e[H',,) ® (i/itQ^ifJ,)} - Yec{J:]')veci^l'y] 
t=l 

T 

hm V [{H2t ® ifu)^ (e^e; ® e.^) (i72t ® i^it)' - vec{J:]')vec{J:l^)'] 



t=l 



(G2(r) ® Gi(r))7W(G2(r) Gi(r))'rfr - / vec(S^2(^))ygp(5.i2^^^y^^^ 



where the identity {E^J){K^L) = {FK)®{J L) is used for matrices of appropriate 



dimensions. The proof of (2.2 ) follow directly from Lemmas 5.1 5.2 5.3 and equation 



(5.3) 



For the proof of (2.4) again note that Vt can be replaced by Vt from (5.3). We 
write 



Vt = U2t ® Uu - vec(S4 ) 

= Il2tet ® Huet - vec{llull2t) 
= {H2t® Hit){tt®et-Yec{Id)}. 
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Define := et ® — vec{Id). We have E{vl) = and Var{vl) = E{ete[ ® ete[) — 
vec(Jrf)vec(/d)' =: Cl. Therefore from Theorem 3.1 of Hansen (1992) it follows that 



t=i 

for < s < 1. □ 



Proof of (3.7) For the sake of simplicity and with no loss of generality (see (5.3)) 
let us assume that Xt = Ut, so that the error process is observed and there is no 
autoregressive parameters to estimate. Conditionally on the Mj's, di^^ is a Gaussian 
process with independent increments and variance 

[Ts] 

t=l 

where E*{.) is the expectation under the bootstrap probability measure. The result 
follows if 

T-'Y^'^A-^ / (G'2(r)®Gi(r))7W(G2(r)®Gi(r))'rfr, 
t=i -^0 

uniformly for all s E [0,1]. Since T~^Ylt=^i'^t'&t is monotonically increasing and 
the limit function is continuous, it suffices to establish the pointwise convergence 
following Hansen (2000, proof of Lemma A. 10). This holds using similar arguments 



as for (5.1) and (5.2), see Patilea and Raissi (2012) Lemmas 7.3 and 7.4. □ 
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